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gvSTFM ANT> MET Pnn FOR DFXIVFRTNG CUSTOMIZED 
voirF. AUDIO DATA ON A PACK ET-SWITCHED NETWORK 



RArKCxROUND r>P THF mVENTION 

Field of the Invention 

Hie present invention relates to the distribution and customization of voice audio data 
products, like telephone signals, via a packet-switched network sudi as networks based on the 
Intemet Protocol (IP). 



Descri ption of Related Art 

The hearing profiles of individuals vary in a number of ways. The ability to hear sounds 
varies with ftequency among individuals across the normal audio fi«quency range. Also, the 
dynamic range varies among individuals S50 that levels of an audio stunulus that are perceived as 

15 soft sounds and levels of an audio stimulus that are perceived as loud sounds differ firan person 
to person. Standard hearing tests are designed to produce an audiogram that characterizes such 
factors as frequency, sensitivity and dynamic range in the hearing profiles of individuals. There 
are also other factors that affect a hearing profile. For example, psycho-acoustic fectors 
concerning the manner in which a person perceives combinations of normal sounds affect Ae 

20 ability to hear in ways that can vary from person to person. Also, environmental fectors such as 
the usual listening environment of a person Oibrary, conference room, concert hall) and the 
equipment on which the sound is produced Goud speakers, ear phones, telephone hand set) are 
importanL In persons wearing hearing aids or using other assistive hearing devices, the type of 
aid or device affects the hearing profile. The physiology of an impairmrait suffered by the 

25 individual may also be an important fectcMrinlhe hearing profile. 

The hearing profiles of individuals have been aqppKed in the hearing aid field for 
customizing and fitting hearing aids fijr individuals. See, for example, U.S. Patent No. 4,73 1 ,850 
entitled PROGRAMMABLE DIGITAL HEARING AID SYSTEM, invented by Levitt et al.; and 
U.S. Patent No. 5,848,171 entitled HEARING AID DEVICE INCORPORATING SIGNAL 

30 PROCESSING TECHNIQUES, invented by Stockham, Jr. et al. Thus, techniques for processing 
sound to ofi&et variations in hearing are well known. However, these techniques arc unavailable 



-1- 



r 



PCT/USOO/26304 

WO 01/24462 

to persons not using hearing aids. Furthennore. many persons who could benefit fiom such 
processing are not in positions to use hearing aids for a variety of reasons. 

Telephone signals are distributed in the public switched telephone networks (PSTNs) in a 
formintendedforpersonshavinghearingprofileswithinanormalrBiige. Proposals for applying 

5 custom processing of telephone signals in the PSTO according to a customer's hearing 

in^ainnenthavebeenpresented. See. for example, U.S. Patent No. 5.388,185 entiUed SYSTEM 
FOR ADAPTIVE PROCESSING OF TELEPHONE VOICE SIC5NALS, invented by Terry et al. 
Such proposals have been Ihnited to dicmt-switched telephone networks with limitations 

inherent in such sjrstems. 

10 Techniques axe being developed for the transmission of real time voice over IP based 

networks, which are sometimes referred to as Voice over Internet Protocol (VoIP) technologies. 
One leading VoIP technology is based on ITU-T Recommendation H.323, "Packet-Based 
Multimedia Communications Systems." firom Ae International Telecommunication Union which 
describes a set of protocols usefiil for supporting real time audio and video communication over 

15 IP networks. 

Aocordin^y, it is desirable to provide systems and methods to apply techniques for the 
processing of real time audio data according to hearing profiles for the benefit of individuals who 
are not wearing hearing assistance devices. Also it is desirable to provide tools to simplify die 
gathering of information needed to develop hearing profiles of mdividuals, and to apply die 
20 gathered information in the distribution and customization of real time voice signab. 



RTT MMARY OF T^F INVENTION 
The prestait mvention provides customized audio for VoIP sessions, or other packet- 
25 switehed networis for carrying voice, and systems and mediods for producing the customized 
audio data for sudi sessions, processed according to a hearing profile of the participants. 
According to the present invention, the quality of VoIP telephone calls and other voice sessions 
on packet-switched networks is improved. 

Thus, the present invention provides a processor coupled to a packet-switched 
30 communication network which processes a flow of packets carrymg voice data according to die 
hearing profiles of the intended recipients. The flow of packets in various embodiments is 



-2- 



f 



WO01/244«2 PCT/US00/2«3M 

compliant with a standard Internet Protocol. In one embodiment, the flow of packets is 
compliant with ITU-T Recommendation H.323. The hearing profile data is stored in one 
embodiment with a registry of users. Resources are included in this embodiment to associate 
data from flie registry characterizing a hearing profile of the user with a flow of packets in the 
5 padket-switdied network for use and customizing the voice data. 

Ihe invention includes a method for communication of voice signals in a packet-switched 
network that includes detecting a flow of packets carrying the voice signals on the network to a 
recipient, utilizing data within the packets. In addition, the method includes obtaining a hearing 
profile of the recipient from memory on at least one of an intermediate station or an end station 

10 in the network. Finally, the flow of packets is processed according to the hearing profile m a 
processor on one of an intermediate station and an end station in the networic A typical session 
on a packet-switched netwoA carrying voice includes a first flow fiom a first participant in the 
call to the second adjustment in the call, and a second flow firom the second participant to the 
first. The processing of the first and second flows can be positioned in different locales in the 

15 network. For example, a flow to a particular participant may be processed in a processor within 
the local area network of the participant Data sent fmm the other participant in the session is 
delivered to the processor within the local area networic of the participant via the Internet for 
processing prior to delivery. In Has manner, the sending station need not have the hearing profile 
data available for the intended recipients, hi alternative systems, the processing of hearing 

20 profile data is executing a single servo: within tiie network which stores the hearing profiles of 
the redpients of the data. 

Accorxiing to anotiier embodimrait of Ae invention, the method includes establishing a 
i^istry of users in a processor coupled to the network. The registry includes parameters 
indicating wheflier processing according to a hearing profile of a user is enabled fi>r fliat usct, 

25 and other parameters necessary fin- establishing the session and providing a hearing profile fi>r 
use within the session. Accordingly, a processor detects a flow of packets carrying voice signals 
on the networic to a recipient, the packets bemg identifiable by parameters witiiin flie packets as 
members of the flow. Next, the processor accesses the registry in response to tiie flow and if the 
recipient is a n^sttared user, a hearing profile is obtained using parameters available from the 

30 registry fiom memory in tite networic. Fmally, the flow of packets is processed according to a 
hearing profile. 
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In another embodiment, the invention includes maintaining the registry in a gate keeper 
processor on the network, wherein the gate keeper processor performs functions compliant with 
ITU-T Recommendation H.323. 

Accoidingly, the present invention provides for die customization of voice sessions in 
5 packet-switdied networks. A customization of voice sessions requires the management of user 
registties and of processing resources necessary for matching individual voice sessions with data 
relevant to the participants in the sessions. The present invention, by taking advantage of the 
power and flexibiUty of packet-switched networks such as the Internet, makes the deUvery of 
voice sessions customized according to individual hearing profiles practical and cost-effective. 
10 Other aspects and advantages of the present invention can be seen on review of the 

drawings, tihe detailed desaciption, and the claims whadi follow. 

Rl^^F nPSPRTPTTON OF THE FIGURES 
Fig. 1 illustrates a network serving voice over the Internet protocol with customized 
15 processing according to the hearing profile of users according to the present invention. 

Fig. 2 is a simpUfied flow drart illustrating Ac processing at a locale in the network of a 
voice over the Internet Protocol session with user hearing profiles. 

Fig. 3 is a simplified block diagram of an H.323 standard network enhanced with hearing 
profile processing according to the present invention. 
20 Fig. 4 iUustrates one example entry in a registry of hearing profiles for the system of Fig. 

3. 

Fig. 5 is a simpUfied flow chart of a process for establishing an H.323 call with 
customized processing according to the present invention. 

Fig. 6 is a diagram of one example of a hearing profile for use in producing customized 

25 audio for VoIP sessions. 

Fig. 7 is a diagram of another example of a hearing profile for use in producing 

customized audio for VoIP sessions. 

Fig. 8 is a diagram of yet another example of a hearing profile for use m producing 
customized audio for VoIP sessions. 



30 
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A detailed description of the various embodiments of the present invention is provided 

with referaice to Figs. 1 tfirou^ 8. 

Fig. 1 provides an illustration of one example network supporting VoIP telephone calls 
with audio data processed according to die hearing profiles of the participants in the call. The 
5 example network includes local area network 10 and local area network 1 1 which are coupled to 
one another flirough an Internet protocol network 12. Also, a gateway server 13 is coupled to the 
Internet protocol network 12 and to Ae public switched telephone network PSTN 16. Local area 
networic 10 includes VoIP server/router 14 which couples the local area netwodc 10 to the 
Internet protocol network 12. Local area network 1 1 includes a VoIP server/router 1 7 which 

1 0 couples the local area network 1 1 to the Internet protocol network 12. 

End stations illustrated in the example of Fig. 1 include the telqjhones 20, 21, 22 coupled 
to the local area network 10, and the voice enabled personal computer 23 coupled to the local 
ar&i networic 10. Also, the stations coupled to the local area networic 1 1 include telephones 25 
and 26, and voice or multimedia enabled personal computers 27 and 28. hi the public switched 

15 telephone network 16, a central oflSce switch 30 connects end station telephones 31 and 32 into 
the network. 

The intermediate station servers 14 and 17 are coupled to hearing proffle databases 15 
and 18 to support processing of VoIP sessions according to the hearing profiles of the 
participants in the call. Thus for example, when a call is established between end station 28 on 

20 local area networic 1 1, and end station 22 on local area network 10, the VoBP server/router 17 

associates the hearing profile of the recipient using end station 28 with the incoming packet flow 
of die session, and processes die audio data using die hearing profile. Likewise, the VoIP 
starver/router 14 coiqpled to die local area network 10 associates die hearing profile of tiie 
recipient using Ae telephone 22 widi die incoming packet flow of die session, and processes die 

25 audio data using die hearing profile as it is ddivered to die telephone 22. In a similar manner, 
when a VoIP session is establidied between an end station, sudi as end station 25 on local area 
networic 1 1, and a telephone coupled to die public switched telephone networic 16, such as 
telephone 32, die VoIP gateway 1 3 associates the hearing profile of die recipient at die telephone 
32 widi incoming packets in the VoIP session, and processes the data using die hearing profile. 

30 The processed data is dien re-formatted for die circuit switched public switched telq>hone 
network fiw delivery to the recipient at die telephone 32. 
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The hearing profile databases 15, 18, 19 illustrated in the example of Fig. 1 maintain 
registries of users of the VoIP service. The registries contain the necessary information to 
support the processing of the packet flows in the VoIP sessions utilizing die hearing profiles of 

usKS having oatries in the databases. 

5 The establishment and maintenance of the hearing profUe databases in the packet- 

switched network can be handled using a variety of network architectures, taking full advantage 
of the packet-switched network flexibility. For example, a central server may be used for 
acquiring, updating, and delivering hearing profiles for use in the VoIP sessions supported by the 
database. Alternatively, distributed servers, each maintaining hearing profile data fi>r a limited 

10 number of users could be utilized which process only incoming packet flows. In yet another 
alternative, the hearing profiles may be stored locally on the end stations, and no intermediate 
station need be involved in the processing of the packet flows according to the hearing profiles. 
A combination of locales in the network may be used to siqjport a particular session, according to 
file capability of the stations involved in the session. 

15 A basic process executed at a locale in the packet-switched network according to the 

present invention fi>r packet flows in a VoIP caU is iUustrated in Fig. 2. The locale in the 
networic either an end station or an mtetmediate station, detects a Voff session by filtering Ae 
packet flow to detect headers or other control infi)rmation tagging the flow as a VoIP session 
(step 50). Upon detection of a VoIP session, a profile of a recipient at tiie destination end station 

20 is retrieved from memory (step 51). Next, the packet flow is processed according to flxe hearing 
profile of the recipient (step 52). Fmally, the processed packets are forwarded to the destination 

end station (step 53). 

As mentioned above, one popular suite of protocols developed for packet based 
rnultimedia communication systems such as Internet protocol based networic systems used fi>r 

25 VoIP sessions is known as ITU-T Recommendation HJ23. Recommendation H.323 is 

incorporated by reference as if fiilly set fortii herein. Fig. 3 provides a simplified illustration of 
an H.323 network enhanced according to the present invention. In tiie network of Fig. 3, an 
H.323 gatekeeper process 1 00 is coupled to the Internet 101. H.323 compliant end stations 102 
and 103 are also coupled to the Internet 101. The H323 compliant gateway 104 is coupled to 

30 end stations, such as end station 105 which support other protocol suites for die VoIP session. 
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The gate keeper process 100 includes an H.323 registry 106 which is enhanced according 
to the present invention with a hearing profile database. TTius, for example, entries in the R323 
registry 106 have a fonnat such as shown in Fig. 4. The fonnat includes a user identifier 120, 
and H,323 call parameters 121 which are developed according to Ihe standard. A pointer to a 

5 hearingprofile.orahearingprofileitselfisstoredinthe field 122. The hearing profile 

processing capability for the user is stored as a parameter 123. A few examples of the hearing 
profile pointer 122 include a URL, a memory address in local memory on the gate keeper, or a 
memory address in apaiticular storage device having another format. The hearing profile 
processing capability parameter 123 indicates where the processing is available for applying the 

10 hearing profile. Thus, par^eter 124 indicates that digital signal processing resources are 

available locally on an end station of the user (e.g., end station 102), parameter 125 indicates that 
digital signal processing resources are available on a gateway 104 through which the user 
accesses the network, and parameter 126 indicates that digital signal processing resources at the 
gate keqper 100 ar« to be used for packet flows destined to the identified user. 

15 Fig. 5 provides a simpUfied flow chart of the processing at the gate keeper 100 of Fig. 4 

enhanced accordmg to present invention. During call establishment, the gate keeper detects an 
H.323 caU set up session (step 150). The hearing profile processing capability for the identified 
end stations is determined using die infistmation in the registry (step 15 1). Next, the hearing 
profile or hearing profiles involved in the session are forwarded to the processors tiiat are 

20 associated with the registered users. Alternatively, the hearing profile is simply retrieved fiom 
the registry for use at the gate keeper processor itself (step 1 52). The H.323 call set up 
procedures estabhsh call paths for each du-ection, taking into account the locale in the network 
for processing of d»e audio signals with the hearing profiles of the users (stqp 1 53). Finally, the 
call setup is completed (step 154). 

25 Figs. 6, 7 and 8 illustrate alternative formats for die hearing profiles to be stored in the 

registry 106, or databases 15, 17, 19 in die examples shown in die Figures. The hearing profile 
shown in Fig. 6 includes a customer ID 250, coefiBdents of transform equations 251, and an 
identifier of the playback device type and Ustening environmait 252 in which the audio data 
product is to be played. In Fig. 7, die hearing profile includes a customer identifier 260, die 

30 audiogram for customer 261, listening condition data 262, and psycho-acoustic parameters 263 
whidi rdate to die listening characteristics of die customer. The hearijig profile shown in Fig. 8 
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includes a customer identifier 270, and software 271 along with code date structures 272 that in 
combination provide executable transform code for producing the customized audio data product 
jftom the selected audio data product. Tlie three examples of hearing profile formats shown in 
Fi^. 6-8 are representative of a large number of hearing profile formats that could be utilized 

5 depending on the type of transfbim processes being executed, the type of audio products being 
delivered, and o&er fectors related to the architecture of the system. 

The system in various embodiments provides input tools, sudi as a graphical user 
interfece usable, for example, at a kiosk or accessible via a network, for accepting input data 
concerning a hearing profile of a customer, either supported by a specialized hearing test device 

10 orbyotherresourcesforuseinthetest. The input tools are used for creating a machine readable 
hearing profile, which is stored in a registry of hearing profiles for use in the production of 
customized audio data products. Alternatively, the hearing profile is stored on a portable data 
storage medium for use by the customer, as described above. The related applications 
incorporated by reference above, include detailed descriptions of technologies for conducting 

15 hearing tests and for gafliering hearing profile data. 

Also, in various embodiments of the invention, the system provides tools for accepting 
input data characterizing the feedback ftom the customer. Processing resources are provided for 
modifying the machine readable hearing profile in response to the feedback data. The modifying 
of the machine readable hearing profile in one aspect of the invention comprises applying an 

20 optunization modeling process to the hearing profile, by which the profiles are improved over 
time using feedback arising fiom a variety of customized audio data products and ftom a variety 
of Ustening conditions. The related appUcations, incorporated by reference above, include 
detaUed descriptions of tedmologies for gathering feedback and optimizing hearing profile data. 
While the present invention is disclosed by reference to the preferred embodiments and 

25 examples detailed above, it is to be understood that these examples are intended in an illustrative 
ra&er than in a limiting sense. It is contemplated that modifications and combinations wUl 
readily occur to those skilled in the art, which modifications and combinations will be within the 
spirit of the invention and the scope of the appaided claims. 
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CLAIMS 

What is claimed is: 

1 1 . For a packet-switched communication network, a station comprising: 

2 a processor coupled to the network whidi processes a flow of packets canying voice dafc 

3 according to hearing profiles of intended recqrients. 

1 2- The station of claim 1, wherein the flow of packets comprises packets compliant 

2 with a standard Internet Protocol. 

1 3. The station of claim 1, wherein the flow of packets comprises packets compliant 

2 with rrU-T Recommeaadation H.323. 

1 4. The station of claim 1, including a registry of users, and including resources to 

2 associate data ftom the registry characterizing a hearing profile of a user in the registry widi the 

3 flow of packets. 

1 5. A method for communication of voice signals in a packet-switched networic, 

2 comprising: 

3 detecting a flow of packets carrying the voice signals on Ae networic to a recipient, the 

4 packets identifiable as members of the flow; 

5 obtaining a hearing profile of the recipient from memory on at least one of an 

6 imermediate station and an end station in the network; and 

7 processing die flow of packets according to the hearing profile in a processor on at least 

8 one ofanintamediate station and an end station in the network. 

1 6. Tliemefliodofclaim 5, including filtering packets received in tiie processor to 

2 detect members of the flow. 

1 7. The method of claim 5, wherein the flow originates at an originating end station 

2 ID the networic, and die memory is on die originating end station, 

-9- 
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1 8. The method of claim 5, wherein the memory is on an intermediate station in the 

2 networic 

1 9. The method ofclaim 5, wherein the flow is transmitted to a destination end 

2 station in the network, and the memory is on the destination end station. 

1 10. A mediod for communication of voice signals in a packet-switched network, 

2 comprising: 

3 establishing a registry of users in a processor coupled to Ae network, the registry 

4 including parameters indicating whether processing according to a hearing profile of a user is 

5 enabled for the user; 

6 detecting a flow of packets carrying the voice signals on the network to a recipient, the 

7 padfietsideatifiable as members of the flow; 

g accessing the registry in response to the flow, and if the recipient is a registered user, 

9 obtaining a hearing profile of the recipient fiom memory in the network; and 

10 processing the flow of padcets according to the hearing profile in a processor in the 

11 network- 

1 1 1 . The mediod of claim 1 0, including filtering packets in the processor in the 

2 network to detect members of the flow. 

1 12. The method of clahn 10, wherein the flow originates at an origmating aid station 

2 in the netwoik, and the memory is on the originating end staltion. 

1 13. The mediod ofclaim 10, wherein the memory is on an intermediate station in flie 

2 network. 

1 14. The mediod of claim 1 0, wherein the flow is transmitted to a destination aid 

2 station in die nttwork, and the memory is on the destination end station. 



-10- 



wo 01/24462 



PCT/US00A26304 



1 15. lliemetbodof claim 10, includingmaintainingtheregistiymagatekee^ 

2 processor on the network, and wherein the gate keeper processor performs fiinctions compliant 

3 with ITU-T Recommendation H.323 . 

1 16. -niemefliodofdaim lO,whereintheparametersincludeapointertoaheaiing 

2 profile for the registered user. 

1 17. The mefliod of claim 10, wherein the processor is on an intermediate station in 1 

2 network, and including routing the flow throu^ the interaiediate station. 

1 1 8. The method of claim 10, wherein the processor is on an end station associated 

2 with the recipient, and including transmitting a copy of the hearing profile from the memory in 

3 tihe network to flie end station. 
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